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A DNA segment having a sequence encoding a protein 
associated with attaching and effacing activity of 
enterohemorrhagic E.coli r or an oligonucleotide fragment 
thereof is provided. The DNA segment permits selection of 
DNA and amino acid sequences unique to a protein 
associated with attaching and effacing activity of 
enterohemorrhagic E.coli . Novel DNA segments or proteins 
can thus be constructed which contain the unique DNA and 
amino acid sequences. The DNA segments or oligonucleotide 
fragments thereof may be use to detect the presence of a 
DNA segment having a sequence encoding a protein 
associated with attaching and effacing activity of 
enterohemorrhagic E.coli . Selected DNA segments or 
oligonucleotide fragments thereof may be used in methods 
for detecting pathogenic verotoxin-producing E.coli . 
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Title: AttaRiing and Effacing Protein of Enterohemorrhagic 



B.coli 



FIELD OF THE INVENTION 

The present invention relates to a DNA segment having a 
sequence encoding a protein associated with attaching and 
effacing activity of enterohemorrhagic E.coli . or an 
oligonucleotide fragment thereof. The DNA segment of the 
invention can be used to produce a protein associated with 
attaching and effacing activity of enterohemorrhagic 
E .coli or a part thereof, by culturing a trans formant host 
cell which includes a recombinant molecule comprising a 
DNA segment of the invention or an oligonucleotide 
fragment thereof and an expression control sequence 
operatively linked to the DNA segment. 

The DNA segment of the invention also permits selection of 
DNA and amino acid sequences unique to a protein 
associated with attaching and effacing activity of 
enterohemorrhagic E .coli . Novel DNA segments or proteins 
can thus be constructed which contain the unique DNA and 
amino acid sequences . The invention also relates to uses 
of the DNA segments encoding a protein associated with 
attaching and effacing activity of enterohemorrhagic 
E.coli . and fragments thereof, and uses of a protein 
associated with attaching and effacing activity of 
enterohemorrhagic E .coli or parts thereof. 

BACKGROUND OF THE INVENTION 

Enteropathogenic Escherichia coli (EPEC) colonize the 
intestine of humans (Ulshen, M. , and Rollo J. (1980) N . 
Engl. J. Med. 302, 99-102) and experimental animals, 
(Tzipori, S., Robbins-Brown, R.M., Gonis , G. , Hayes, J., 
Withers, M* , and McCartney, E. (1985) Gut 26, 570-578) 
producing a characteristic lesion which has been called 
attaching and effacing (AE) (Moon, H.W., Whipp, S.C., 
Argenzio, R.A., Levin, M.M., and Gianella, R. A . 1983. 



Infect. ImiSP 41, 1340-1351). The lesion ^^sists of an 

extremely close (lOnm) approximation of the bacterial 
outer membrane to the intestinal epithelial cell membrane. 
The microvilli of the enterocyte are effaced and there is 

5 polymerization of actin in the cytoplasm subjacent to the 

area of bacterial attachment (Knutton, S. , Baldwin, T., 
Williams, P.H., and McNeish, A.S. (1989) Infect. Immun. 
57, 1290-1298). The AE lesion is also produced by 
enterohemorrhagic E.coli (EHEC) of serogroups 026, 0111 

!0 and 0157 (Tzipori, S. , Wachsmuth, I., Smithers, J., and 

Jackson, C, (1988) Gastroenterology 94, 590-597; Tzipori, 
S., Wachsmuth, I.K., Chapman, C, Birner, R. , Brittingham, 
J., Jackson, C, and Hogg, J. (1986) J. Infect. Dis . 154, 
712-716) . 

15 

Escherichia coli serotype 0157:H7 is the most important 
member of the group of E.coli which produce verotoxins and 
is a major public health concern. Among the verotoxin- 
producing E.coli , serotype 0157 :H7 is most commonly 

20 associated with human disease. This serotype is a 

recognized cause of hemorrhagic colitis, occurring both 
sporadically and in major outbreaks, and is also a cause 
of the hemolytic uremic syndrome especially among children 
and the elderly. These organisms are probably acquired by 

25 humans from an animal source via contaminated meat and 

occasionally unpasteurized dairy products. Outbreaks of 
disease caused by 0157 :H7 have been associated with 
consumption of undercooked hamburgers and raw milk (Riley 
LW, Remis RS, Helgerson SD, et al. N Engl J Med 1983; 

30 308:681-5; Borczyk AA, Karmali MA, Lior H, et al . 

(Letter). Lancet 1987; 1:98; Duncan L, Mai V, Carter A, 
et al. Outbreak of gastrointestinal disease - Ontario. Can 
Dis Wkly Rep 1987; 13-2:5-8). 

35 A method that would identify these organisms in animal and 

food samples would be very useful for screening infected 
herds and contaminated food. Presently, the detection of 
verotoxin and DNA probes for verotoxin do not distinguish 
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verotoxin-producing E . coli which are important causes of 
human disease such as 0157 :H7 from those which, although 
present in animals, rarely if ever cause human disease. 

Jerse et al. ((1990) Proc . Natl. Acad. Sci. USA 87, 7839- 
7843) and Donnenberg et al . ((1990) Infect. Immun. 58, 
1565-1571) have identified a gene in the EPEC strain 
E2348/69 which was necessary but not sufficient for the 
formation of the AE lesion. This gene of approximately 
3kb, which was designated eae, encodes a 94 kilodalton 
outer membrane protein (Jerse, A.E. and Kaper, J.B. (1991) 
Abst. Ann. Am. Soc . Microbiol. B-112, p. 44). It was shown 
that E2348/69 mutants carrying transposon TnPhoA 
insertions in the eae gene were unable to produce the AE 
lesion when ^incubated with HEp-2 cells (Donnenberg M. , 
Calderwood, S., Donohue-Rolf e, A., Keusch, G.T., and 
Kaper, J. (1990) Infect. Immun. 58, 1565-1571). A Ikb 
Stul-Sall fragment which encompasses the central one third 
of the EPEC eae gene was found to hybridize with DNA 
isolated from bacteria of classical EPEC serogroups as 
well as with DNA isolated from EHEC of serogroups 026 and 
0157 (Jerse, A., Yu, J., Tall, B. , and Kaper, J. (1990) 
Proc. Natl. Acad. Sci. USA 87, 7839-7843). Although both 
EPEC and EHEC strains cause ultrastructurally similar 
lesions, as shown in gnotobiotic piglets (Moon, H.W. , 
Whipp, S.C., Argenzio, R.A., Levin, M.M., and Gianella, 
R . A . 1983. Infect. Immun. 41, 1340-1351; Tzipori, S., 
Wachsmuth, I.K., Chapman, C, Birner, R., Brittingham, J., 
Jackson, C, and Hogg, J. (1986) J. Infect. Dis . 154, 712- 
716) and in tissue culture cells (Knutton, S., Baldwin, 
T. , Williams, P.H. , and McNeish, A.S. (1989) Infect. 
Immun. 57, 1290-1298), the two types of infection can be 
differentiated by anatomic site of involvement, suggesting 
differences in adherence factors. 

SUMMARY OF THE INVENTION 

The present invention provides a purified and isolated DNA 
segment having a sequence which codes for a protein 
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Iktivity of 
Kodiment , a 



purified and isolated DNA segment is provided having a 
sequence which codes for a protein associated with 
attaching and effacing activity of enterohemorrhagic 
E » coli having an amino acid sequence which has substantial 
homology with the amino acid sequence as shown in the 
Sequence Listing as SEQ ID NO: 1. Most preferably, the 
purified and isolated DNA segment has a sequence having 
substantial sequence homology with the nucleotide sequence 
as shown in the Sequence Listing as SEQ ID N0:1, 

The invention also relates to a recombinant molecule 
adapted for transformation of a host cell comprising the 
DNA segment of the invention operatively linked to an 
expression control sequence. A transformant host cell 
including a recombinant molecule of the invention is also 
provided. Still further, this invention provides plasmids 
which comprise the recombinant molecules of the invention. 

The present invention further relates to an avirulent 
strain of an enterohemorrhagic E.coli comprising an 
avirulent bacterial carrier strain transformed with a 
recombinant molecule of the invention, and a vaccine 
composition comprising a bacterial carrier strain 
transformed with a recombinant molecule of the invention. 

The invention also provides a method of preparing a 
protein associated with attaching and effacing activity of 
enterohemorrhagic E .coli f utilizing a DNA segment of the 
invention. The method comprises culturing a transformant 
host cell including a recombinant molecule comprising a 
DNA segment of the invention and an expression control 
sequence operatively linked to the DNA segment, in a 
suitable medium until the protein is formed and thereafter 
isolating the protein. 
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The inven 




still further- provides a sub" 



tially pure 



protein associated with attaching and effacing activity of 
enterohemorrhagic E . coli . In a preferred embodiment the 
protein has an amino acid sequence which has substantial 
homology with the amino acid sequence as shown in the 
Sequence Listing as SEQ ID NO:l. 

The invention also relates to antibodies specific for an 
epitope of a protein of the invention, preferably a 
monoclonal antibody, and a method for preparing the 
antibodies . 

A diagnostic kit for detecting enterohemorrhagic E.coli , 
preferably E .coli 0157 in a sample comprising a monoclonal 
antibody of the invention and directions for its use is 
also provided. 

The invention also contemplates an immunoassay for the 
detection of enterohemorrhagic E.coli , preferably E.coli 
0157, comprising contacting a sample suspected of 
containing enterohemorrhagic E .coli with a monoclonal 
antibody of the invention to form an immune complex; and 
determining the presence of the immune complex. 

The DNA segments of the invention or oligonucleotide 
fragments of the DNA segments, allow those skilled in the 
art to construct nucleotide probes for use in the 
detection of nucleotide sequences in samples such as 
biological or food samples. The nucleotide probes may be 
used to detect nucleotide sequences that encode proteins 
related to or analogous to the attaching and effacing 
protein of the invention. 

Accordingly, the invention provides a method for detecting 
the presence of a DNA segment having a sequence encoding 
a protein related to or analogous to a protein associated 
with attaching and effacing activity of enterohemorrhagic 
E.coli or an oligonucleotide fragment thereof in a sample, 
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comprising contacting the sample with a nucKotide probe 
capable of hybridizing with the DNA segment or an 
oligonucleotide fragment thereof, to form a hybridization 
product, under conditions which permit the formation of 
the hybridization product, and assaying for the 
hybridization product. 



10 



The invention further provides a kit for detecting the 
presence of a DNA segment having a sequence encoding a 
protein related to or analogous to a protein associated 
with attaching and effacing activity of enterohemorrhagic 
E.coli or an oligonucleotide fragment thereof in a sample, 
comprising a nucleotide probe capable of hybridizing with 
the DNA segment or an oligonucleotide fragment thereof, 
15 reagents required for hybridization of the nucleotide 

probe with the DNA segment or an oligonucleotide fragment 
thereof, and directions for its use. 



20 



25 



30 



The DNA segment of the invention also permits the 
identification and isolation, or synthesis, of nucleotide 
sequences which may be used as primers to amplify a DNA 
segment of the invention or an oligonucleotide fragment 
thereof, for example in the polymerase chain reaction 
(PCR) . 



Accordingly, the invention relates to a method of 
determining the presence of a DNA segment having a 
sequence encoding a protein associated with attaching and 
effacing activity of enterohemorrhagic E.coli or a 
predetermined oligonucleotide fragment thereof in a 
sample, comprising treating the sample with primers which 
are capable of amplifying the DNA segment or the 
predetermined oligonucleotide fragment thereof, in a 
polymerase chain reaction to form amplified sequences, 
35 under conditions which permit the formation of amplified 

sequences, and, assaying for amplified sequences. 
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The invention further relates to a kit for determining the 
presence of a DNA segment having a sequence encoding a 
protein associated with attaching and effacing activity of 
enterohemorrhagic E .coli or a predetermined 
oligonucleotide fragment thereof in a sample, comprising 
primers which are capable of amplifying the DNA segment or 
the predetermined oligonucleotide fragment thereof in a 
polymerase chain reaction to form amplified sequences, 
means for assaying the amplified sequences, and directions 
for its use. 



The present inventors have designed specific nucleotide 
sequences for detecting pathogenic verotoxin-producing 
E . coli strains . The present invention therefore further 
provides a purified and isolated nucleotide sequence 
comprising the sequence shown in the Sequence Listing as 
SEQ ID NO: 2; the sequence shown in the Sequence Listing as 
SEQ ID NO: 3; the sequence shown in the Sequence Listing as 
SEQ ID NO: 4, the sequence shown in the Sequence Listing as 
SEQ ID NO:5; the sequence shown in the Sequence Listing as 
SEQ ID NO: 6; the sequence shown in the Sequence Listing as 
SEQ ID NO:7; the sequence shown in the Sequence Listing as 
SEQ ID NO: 8; the sequence shown in the Sequence Listing as 
SEQ ID NO: 9; a sequence having substantial homology 
thereto, or a fragment of the nucleotide sequence. 

Accordingly, the present invention relates to a method for 
detecting verotoxin-producing E .coli of the serogroups 05, 
026, 0103, 0111, 0118, 0145, and 0157 in a sample 
comprising contacting the sample with a Sall-StuI fragment 
comprising the sequence as shown in the Sequence Listing 
as SEQ ID NO: 2 and the sequence as shown in the Sequence 
Listing as SEQ ID NO: 3, under conditions which permit the 
fragment to hybridize with a complementary sequence in the 
sample to form a hybridization product, and assaying for 
the hybridization product. 
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The invention also contemplates a method detecting 
vero toxin-producing E.coli serogroup 0157 in a sample 
comprising contacting the sample with a 0.5 kb fragment 
comprising the sequence as shown in the Sequence Listing 
as SEQ ID NO: 4 and the sequence as shown in the Sequence 
Listing as SEQ ID NO: 5, under conditions which permit the 
fragment to hybridize with a complementary sequence in the 
sample to form a hybridization product, and assaying for 
the hybridization product. 



A method for detecting pathogenic verotoxin-producing 
E . coli strains of the serogroup 0157 in a sample is also 
provided comprising treating the sample with the 
nucleotide sequence P1EH shown in the Sequence Listing as 

15 SEQ ID NO: 6 and the nucleotide sequence P2EH5 shown in the 

Sequence Listing as SEQ ID NO: 7 in the polymerase chain 
reaction, under conditions which permit the formation of 
amplified sequences which terminate with the nucleic acid 
sequence of one of the primers and the complementary 

20 nucleic acid sequence of the other of the primers, and, 

assaying for the amplified sequences . 

The invention further provides a kit for detecting 
verotoxin-producing E.coli of the serogroups 05, 026, 

25 0103, 0111, 0118, 0145, and 0157 in a sample comprising a 

Sall-StuI fragment comprising the sequence as shown in the 
Sequence Listing as SEQ ID NO: 2 and the sequence as shown 
in the Sequence Listing as SEQ ID NO: 3, reagents required 
for hybridization of the fragment with a DNA segment 

30 encoding the attaching and effacing protein of 

enterohemorrhagic E .coli or an oligonucleotide fragment 
thereof, and directions for its use. 

The invention also relates to a kit for detecting 
35 verotoxin-producing E.coli of the serogroup 0157 in a 

sample is comprising a 0.5 kb fragment (A3-B2) comprising 
the sequence as shown in the Sequence Listing as SEQ ID 
NO: 4 and the sequence as shown in the Sequence Listing as 
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SEQ ID NO: 5/ reagents required for hybridization of the 
fragment with a DNA segment encoding the attaching and 
effacing protein of enterohemorrhagic E . coli or an 
oligonucleotide fragment thereof, and directions for its 
use . 

The invention further relates to a kit for detecting 
pathogenic vero toxin-producing E .coli strains of the 
serogroup 0157 in a sample comprising the primers P1EH as 
shown in the Sequence Listing as SEQ ID NOs6 and P2EH as 
shown in the Sequence Listing as SEQ ID NO:7, 
respectively, reagents required for the amplification in 
a PCR reaction of sequences which terminate with the 
nucleic acid sequence of one of the primers and the 
complementary nucleic acid sequence of the other of the 
primers, and directions for its use, 

DESCRIPTION OF THE DRAWINGS 

The invention will now be described in relation to the 
drawings in which: 

Figure 1 shows the DNA sequence of the EHEC eae gene and 
the deduced protein sequence starting at the start codon 
at 206 bp; 

Figure 2 shows the homology between the deduced protein 
sequences of the EPEC and EHEC eae genes; 

Figure 3 shows the amino acid sequence alignment of the 
predicted EPEC and EHEC proteins; 

Figure 4 shows the results of colony blot hybridizations 
of various O serogroups using A) a central (C1-C2) 1 kb 
fragment and B) a 3 1 (A3-B2) 0,5 fragment of the 0157 :H7 
eae gene; 



DOT'iTi:.) 



Figure 5 shows the results of colony blot 





uridizations 
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30 



of various 0 serogroups using a 0.45 kb fragment (P1EH- 
P2EH) of the 0157 :H7 eae gene; 

Figure 6 shows the primer locations on the 3 ' region of 
EHEC eae gene; 

Figure 7 shows the amplification products of VTEC and EPEC 
using specific primer sets A3-B2 (0.5 kb) and P1EH-P2EH 
(0.45 kb); 

Figure 8 shows the results of colony blot hybridizations 
of serogroups 0157 (VT+), 0157 (VT-), 05, 026, 0111, 0113, 
0121, and 0127 with [a 32 P] dATP end-labeled with P2EH 
oligonucleotide ; and 

Figure 9 shows the alignment of the deduced amino acid 
sequences at the 3* end of: 0157: EHEC eae 0157 :H7 
protein; Ollls EHEC eae 0111 :H8 protein; EPEC: EPEC eae 
0127:H6 protein; and INV : Y . pseudotuberculosis invasin. 

DETAILED DESCRIPTION OF THE INVENTION 

The present inventors have identified a chromosomal eae 
gene associated with attaching and effacing activity in 
enterohemorrhagic E .coli (EHEC). The EHEC eae gene was 
identified from EHEC serotype 0157 :H7 strain CL-8. The 
entire predicted structural gene was sequenced including 
205 bp upstream and 118 bp downstream. Figure 1 and the 
sequence shown in the Sequence Listing as SEQ ID N0:1, 
show the DNA sequence of the EHEC eae gene and the deduced 
protein sequence starting at the start codon at 206 bp. 
Comparison of nucleotide sequence similarity between the 
EHEC and EPEC eae genes shows 97% homology in the first 
2200 bp and only 59% homology in the last 800 bp (Figure 
2). The amino acid sequence alignment of the predicted 
EPEC and EHEC proteins is shown in Figure 3, The N- 
terminal amino acids are highly conserved for the first 
685 amino acids while for the C-terminal amino acids, only 
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s can be aligned. There is 



Nervation of 



cysteine residues at positions 859 and 933 of the 
alignment . 

Both the EHEC and EPEC sequences show similarity to the 
Yersinia pseudotuberculosis invasin gene (Isberg, R.R. et 
al., (1987) Cell 50, p. 769) with greatest divergence at 
the C-terminus. The C-terminal end of the Yersinia inv 
gene is associated with receptor binding (Leong et al , 
EMBO J 1990; 6:1979-89) and it is possible that the same 
applies to the eae gene products . 

The inventors also probed vero toxin-producing E .coli 
(VTEC) (251 strains) of multiple O-serogroups with a 
highly conserved Ikb Stu l- Sal l central fragment from the 
EHEC eae gene using colony hybridization. Only the 
following VTEC serogroups (125 strains) were found to be 
positive: 05, 026, 0103, 0111, 0118, 0145, and 0157. 
Using the polymerase chain reaction (PCR) a less conserved 
0 . 5kb probe was generated from the 3 ' end of the EHEC eae 
gene. Among the 251 VTEC strains tested using colony 
hybridization, only those strains belonging to the 0157 
serogroup were positive. Additionally, the 3* end of eae 
genes of VTEC strains appear to have unique nucleotide 
sequences as determined on selected O-serogroups . These 
O-serogroups initially hybridized with the highly 
conserved central regions of the eae genes . 

Therefore, the present inventors have shown that the 
central 1 kb fragment of the EHEC eae gene is homologous 
in a variety of 0 serogroups of verotoxin-producing E .coli 
(VTEC) strains but that there is diversity at the C- 
terminal end. The heterogeneity at the 3* end of eae genes 
allows for specific detection of clinically relevant VTEC 
strains and may be associated with differences in receptor 
binding . 
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Accordingly, the present invention provides a purified and 
isolated DNA segment having a sequence which codes for a 
protein associated with attaching and effacing activity of 
enterohemorrhagic E .coli . 

In a preferred embodiment, a purified and isolated DNA 
segment is provided having a sequence which codes for a 
protein associated with attaching and effacing activity of 
enterohemorrhagic E * coli having an amino acid sequence 
which has substantial homology with the amino acid 
sequence as shown in the Sequence Listing as SEQ ID NO:l. 
Most preferably, the purified and isolated DNA segment has 
a sequence having substantial sequence homology with the 
nucleotide sequence as shown in the Sequence Listing as 
SEQ ID NO:l. 

DNA segments of the present invention encoding a protein 
associated with attaching and effacing activity of EHEC or 
related or analogous sequences may be isolated and 
sequenced, by selectively amplifying the region of the eae 
gene using the polymerase chain reaction method and 
genomic DNA. It is possible to design synthetic 
oligonucleotide primers from the sequence shown in the 
Sequence Listing as SEQ ID NO:l for use in PCR and for 
screening genomic libraries. An amplified fragment can be 
cloned and characterized by DNA sequence analysis. DNA 
segments of the present invention encoding the protein 
associated with attaching and effacing activity of EHEC 
may also be constructed by chemical synthesis and 
enzymatic ligation reactions using procedures known in the 
ar t t . 

It will be appreciated that the invention includes 
nucleotide or amino acid sequences which have substantial 
sequence homology with the nucleotide and amino acid 
sequences shown in the Sequence Listing as SEQ ID NOsl. 
The term "sequences having substantial sequence homology" 
means those nucleotide and amino acid sequences which have 
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slight or inconsequential sequence variations from the 
sequences disclosed in the Sequence Listing as SEQ ID NO:l 
i.e. the homologous sequences function in substantially 
the same manner to produce substantially the same 
polypeptides as the actual sequences . The variations may 
be attributable to local mutations or structural 
modifications . 

It will also be appreciated that a double stranded 
nucleotide sequence comprising a DNA segment of the 
invention as shown in Figure 1 or an oligonucleotide 
fragment thereof, hydrogen bonded to a complementary 
nucleotide base sequence (see Sequence Listing SEQ ID 
NO:l), an RNA made by transcription of this doubled 
stranded nucleotide sequence, and an antisense strand of 
a DNA segment of the invention or an oligonucleotide 
fragment of the DNA segment, are contemplated within the 
scope of the invention. 

A number of unique restriction sequences for restriction 
enzymes are incorporated in the DNA segment identified in 
the Sequence Listing as SEQ ID NOil, and these provide 
access to nucleotide sequences which code for polypeptides 
unique to the protein associated with attaching and 
effacing activity of EHEC. DNA sequences unique to the 
protein associated with attaching and effacing activity of 
EHEC or isoforms thereof, can also be constructed by 
chemical synthesis and enzymatic ligation reactions 
carried out by procedures known in the art. 

The DNA segments of the invention or oligonucleotide 
fragments of the DNA segments, allow those skilled in the 
art to construct nucleotide probes for use in the 
detection of nucleotide sequences in samples such as 
biological specimens or food samples, A nucleotide probe 
may be labelled with a detectable marker such as a 
radioactive label which provides for an adequate signal 
and has sufficient half -life such as 32 P, 3 H, 14 C or the 
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like. Other labels which may be used include antigens 
that are recognized by a specific labelled antibody, 
fluorescent compounds, enzymes, antibodies specific for a 
labelled antigen, and chemiluminescent compounds. An 
appropriate label may be selected having regard to the 
rate of hybridization and binding of the probe to the 
nucleotide to be detected and the amount of nucleotide 
available for hybridization. 

The nucleotide probes may be used to detect DNA segments 
having sequences that encode proteins related to or 
analogous to the protein associated with attaching and 
effacing activity of EHEC . 

Accordingly the present invention also relates to a method 
for detecting the presence of a DNA segment having a 
sequence encoding a protein related to or analogous to a 
protein associated with attaching and effacing activity of 
entero hemorrhagic E . coli or an oligonucleotide fragment 
thereof in a sample, comprising contacting the sample with 
a nucleotide probe capable of hybridizing with the DNA 
segment or an oligonucleotide fragment thereof to form a 
hybridization product, under conditions which permit the 
formation of the hybridization product, and assaying for 
the hybridization product. The nucleotide probe may be 
labelled with a detectable marker as described herein and 
the hybridization product may be assayed by detecting the 
detectable marker or the detectable change produced by the 
detectable marker. 

The invention also provides a kit for detecting the 
presence of a DNA segment having a sequence encoding a 
protein related to or analogous to a protein associated 
with attaching and effacing activity of enterohemorrhagic 
E . coli or an oligonucleotide fragment thereof in a sample, 
comprising a nucleotide probe capable of hybridizing with 
the DNA segment or an oligonucleotide fragment thereof, 
reagents required for hybridization of the nucleotide 
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probe with DNA segment or an oligonucl de fragment 

thereof, and directions for its use. 

The DNA segment of the invention permits the 
identification and isolation, or synthesis of nucleotide 
sequences which may be used as primers to amplify a DNA 
segment of the invention or an oligonucleotide sequence 
thereof, for example in the polymerase chain reaction 
(PCR). The length and bases of the primers for use in the 
PCR are selected so that they will hybridize to different 
strands of the desired sequence and at relative positions 
along the sequence such that an extension product 
synthesized from one primer when it is separated from its 
template can serve as a template for extension of the 
other primer into a nucleic acid of defined length* 

The primers may be prepared using techniques known in the 
art such as for example phosphotriester and phosphodiester 
methods or automated techniques. Restriction endonuclease 
digests may also be used as primers. 

It will be appreciated that the primers may contain non- 
complementary sequences provided that a sufficient amount 
of the primer contains a sequence which is complementary 
to the DNA segment of the invention or oligonucleotide 
sequence thereof, which is to be amplified. Restriction 
site linkers may also be incorporated into the primers 
allowing for digestion of the amplified products with the 
appropriate restriction enzymes facilitating cloning and 
sequencing of the amplified product. 

It will also be appreciated that the invention includes 
nucleotide sequences which have substantial sequence 
homology with the above-mentioned nucleotide probes and 
primers. Further, it will be appreciated that a double 
stranded nucleotide sequence comprising a nucleotide probe 
or primer, hydrogen bonded to a complementary nucleotide 
base sequence, an RNA made by transcription of this 
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doubled st ed nucleotide sequence, an^j^h antisense 
strand of a nucleotide probe or primer, are contemplated 
within the scope of the invention, 

In an embodiment of the invention a method of determining 
the presence of a DNA segment having a sequence encoding 
a protein associated with attaching and effacing activity 
of enterohemorrhagic E.coli or a predetermined 
oligonucleotide fragment thereof in a sample, is provided 
comprising treating the sample with primers which are 
capable of amplifying the DNA segment or the predetermined 
oligonucleotide fragment thereof in a polymerase chain 
reaction to form amplified sequences, under conditions 
which permit the f ormation of amplified sequences and, 
assaying for amplified sequences. 



The polymerase chain reaction refers to a process for 
amplifying a target nucleic acid sequence as generally 
described in Innis et al, Academic Press, 1990 and U.S. 
Patent 4,800,159. Generally, amplification of the target 
nucleic acid sequence may be accomplished by means of a 
pair of primers which flank the nucleic acid sequence to 
be amplified. The primers hybridize to opposite strands 
of the target sequence and DNA synthesis proceeds across 
the region between the primers, thereby doubling the 
amount of that DNA segment. Repeated cycles of 
denaturation, priming and extension permit rapid 
exponential amplification of the target sequence. 

The amplified products can be isolated and distinguished 
based on their respective sizes using techniques known in 
the art. For example, after amplification, the DNA sample 
can be separated on an agarose gel and visualized, after 
staining with ethidium bromide, under ultra violet (UV) 
light. DNA may be amplified to a desired level and a 
further extension reaction may be performed to incorporate 
nucleotide derivatives having detectable markers such as 
radioactive labelled or biotin labelled nucleoside 
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triphosphates. The detectable markers may be analyzed by 
restriction and electrophoretic separation or other 
techniques known in the art. 

The conditions which may be employed in the methods of the 
invention using PCR are those which permit hybridization 
and amplification reactions to proceed in the presence of 
DNA in a sample and appropriate complementary 
hybridization primers. Conditions suitable for the 
polymerase chain reaction are generally known in the art. 
Preferably , the PCR utilizes Tag polymerase (GeneAmp Kit, 
Perkin Elmer Cetus) as the polymerization agent and each 
cycle consists of the following: denaturation at 94°C X 1 
min; annealing at 55°C X 1 min? and extension at 7 2°C X 1 
min. 

The invention still further provides a kit for determining 
the presence of a DNA segment encoding a protein 
associated with attaching and effacing activity of 
enterohemorrhagic E.coli or a predetermined 
oligonucleotide fragment thereof in a sample comprising 
primers which are capable of amplifying the DNA segment or 
the predetermined oligonucleotide fragment thereof, in a 
polymerase chain reaction to form amplified sequences, 
means for detecting the amplified sequences, and 
directions for its use. 

The present invention also relates to a method of 
detecting verotoxin-producing E .coli using the DNA segment 
of the invention and oligonucleotide fragments thereof. As 
hereinbefore mentioned the present inventors have found 
that a Sall-StuI fragment (C1-C2) comprising the sequences 
as shown in the Sequence Listing as SEQ ID NO: 2 and SEQ ID 
NO: 3, may be used to detect verotoxin-producing E . coli of 
the serogroups 05, 026, 0103, 0111, 0118, 0145, and 0157. 
A 0.5 kb fragment (A3-B2) comprising the sequences as 
shown in the Sequence Listing as SEQ ID NO: 4 and SEQ ID 
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NO: 5 has also been found to be useful for detecting 
verotoxin-producing E . coli serogroup 0157. 

The present inventors have also designed specific 
nucleotide probes for detecting pathogenic verotoxin- 
producing E . coli strains of the serogroup 0157. In 
particular, the synthetic nucleotide probes P1EH and P2EH 
shown in the Sequence Listing as SEQ ID NO: 6 and SEQ ID 
NO: 7, respectively, were found to specifically detect only 
verotoxin-producing E.coli serotypes 0157 :H7 and 0157 :NM. 
Non verotoxin-producing 0157 :NM strains and other 0157 :H 
serotypes were not detected by P1EH and P2EH, making these 
probes ideal for detecting pathogenic 0157 EHEC * 

The synthetic nucleotide probes P1EH and P2EH shown in the 
Sequence Listing as SEQ ID NO: 6 and SEQ ID NO: 7, 
respectively, were also used in PCR and were found to 
specifically detect only verotoxin-producing E.coli 0157 
serogroups . Non verotoxin-producing 0157 serogroups were 
not detected by P1EH and P2EH, making these probes ideal 
for detecting pathogenic 0157 EHEC* 

The nucleotide sequences PIO and P20 shown in the Sequence 
Listing as SEQ ID NO: 8 and SEQ ID NO: 9, respectively, 
based on the 0111 :H8 sequence were found to detect 
verotoxin-producing 0111 :H8 and OHl:NM but not 0111 :H11 
strains indicating that there is further diversity in the 
carboxyl-terminal within the serogroup. 

Accordingly, the present invention provides a purified and 
isolated nucleotide sequence comprising the sequence shown 
in the Sequence Listing as SEQ ID NO: 2; the sequence shown 
in the Sequence Listing as SEQ ID NO: 3; the sequence shown 
in the Sequence Listing as SEQ ID NO: 4, the sequence shown 
in the Sequence Listing as SEQ ID NO: 5; the sequence shown 
in the Sequence Listing as SEQ ID NO: 6, the sequence shown 
in the Sequence Listing as SEQ ID NO: 7; the sequence shown 
in the Sequence Listing as SEQ ID NO: 8; the sequence shown 
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in the Sequence Listing as SEQ ID NO: 9; a sequence having 
substantial homology thereto or a fragment of the 
nucleotide sequence. The location of the sequences of the 
sequence shown in the Sequence Listing as SEQ ID NO: 2, SEQ 
ID NO: 3, SEQ ID NO: 4 , SEQ ID NO: 5, SEQ ID NO: 6 and SEQ ID 
NO:7 on the 0157 :H7 eae gene are shown in Figure 6. 

The invention relates to a method for detecting verotoxin- 
producing E.coli of the serogroups 05, 026, 0103, 0111, 
0118, 0145, and 0157 in a sample, comprising contacting 
the sample with a Sall-StuI fragment (C1-C2) comprising 
the sequence as shown in the Sequence Listing as SEQ ID 
NO: 2 and the sequence as shown in the Sequence Listing as 
SEQ ID NO: 3, under conditions which permit the fragment to 
hybridize with a complementary sequence in the sample to 
form a hybridization product, and assaying for the 
hybridization product. 

The invention relates to a method for detecting verotoxin- 
producing E.coli serogroup 0157 in a sample comprising 
contacting the sample with a 0.5 kb fragment (A3-B2) 
comprising the sequence as shown in the Sequence Listing 
as SEQ ID NO: 4 and the sequence as shown in the Sequence 
Listing as SEQ ID NO: 5, under conditions which permit the 
fragment to hybridize with a complementary sequence in the 
sample to form a hybridization product, and assaying for 
the hybridization product. 

Hybridization conditions which may be used in the method 
of the invention are known in the art and are described 
for example in Sambrook J, Fritch EF, Maniatis T. In: 
Molecular Cloning, A Laboratory Manual, 1989. (Nolan C, 
Ed.), Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY. The hybridization product may be assayed using 
techniques known in the art. The oligonucleotide fragments 
may be labelled with a detectable marker as described 
herein and the hybridization product may be assayed by 
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produced by the detectable marker. 

The invention still further relates to a method for 
detecting pathogenic verotoxin-producing E . coli strains of 
the serogroup 0157 in a sample comprising treating the 
sample with the nucleotide sequ nee P1EH shown in sequence 
the Sequence Listing as SEQ ID NOs6 and the nucleotide 
sequence P2EH5 shown in the Sequence Listing as SEQ ID 
NO: 7 in a PCR reaction, under conditions which permit the 
formation of amplified sequences which terminate with the 
nucleic acid sequence of one of the primers and the 
complementary nucleic acid sequence of the other of the 
primers, and, assaying for the amplified sequences. 

The PCR reaction and suitable conditions permitting 
formation of amplified sequences are described above, 

A kit for detecting verotoxin-producing E.coli of the 
serogroups 05, 026, 0103, 0111, 0118, 0145, and 0157 in a 
sample is provided comprising a Sall-StuI fragment (C1-C2) 
comprising the sequence as shown in the Sequence Listing 
as SEQ ID NO: 2 and the sequence as shown in the Sequence 
Listing as SEQ ID NO: 3, reagents required for 
hybridization of the fragment with a DNA segment encoding 
the attaching and effacing protein of EHEC or an 
oligonucleotide fragment thereof, and directions for its 
use, 

A kit for detecting verotoxin-producing E .coli of the 
serogroup 0157 in a sample is also provided comprising a 
0,5 kb fragment (A3-B2) comprising the sequence as shown 
in the Sequence Listing as SEQ ID NO: 4 and the sequence as 
shown in the Sequence Listing as SEQ ID NO: 5, reagents 
required for hybridization of the fragment with a DNA 
segment encoding the attaching and effacing protein of 
EHEC or an oligonucleotide fragment thereof, and 
directions for its use. 
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pathogenic verotoxin-producing E.coli strains of the 
serogroup 0157 in a sample comprising the primers P1EH and 
P2EH as shown in the Sequence Listing as SEQ ID NO: 6 and 
as SEQ ID NO: 7, respectively, reagents required for the 
amplification in a PCR reaction of sequences which 
terminate with the nucleic acid sequence of one of the 
primers and the complementary nucleic acid sequence of the 
other of the primers, and directions for its use. 

The DNA segment of the present invention having a sequence 
which codes for a protein associated with attaching and 
effacing activity of EHEC, or an oligonucleotide fragment 
of the DNA segment including the nucleotide sequences of 
probes and primers described herein may be incorporated in 
a known manner into a recombinant molecule which ensures 
good expression of the protein or part thereof. In 
general, a recombinant molecule of the invention contains 
the DNA segment or an oligonucleotide fragment thereof of 
the invention and an expression control sequence 
operatively linked to the DNA segment or oligonucleotide 
fragment. The DNA segment of the invention or an 
oligonucleotide fragment thereof, may be incorporated into 
a plasmid vector, for example, pTZ18R. 

The protein associated with attaching and effacing 
activity of EHEC or isoforms or parts thereof, may be 
obtained by expression in a suitable host cell using 
techniques known in the art. Suitable host cells include 
prokaryotic or eukaryotic organisms or cell lines, for 
example, E.coli JM 101 and E.coli LE 392. The protein or 
parts thereof may be prepared by chemical synthesis using 
techniques well known in the chemistry of proteins such as 
solid phase synthesis (Merrifield, 1964, J. Am. Chem. 
Assoc. 85:2149-2154) or synthesis in homogenous solution 
(Houbenweyl, 1987, Methods of Organic Chemistry, ed. E. 
Wansch, Vol. 15 I and II, Thieme, Stuttgart). 



- 21 - 



The proteiS 



m 



Associated with attaching 



2078710 



effacing 



activity of EHEC or isoforms or parts thereof, of the 
invention may be expressed in an avirulent bacterial 
carrier strain such as Salmonella and Shigella strains. 
Accordingly, in a further aspect of the invention an 
avirulent strain of an EHEC is provided comprising an 
avirulent bacterial carrier strain transformed with a 
recombinant molecule of the invention. The avirulent 
strain may provide the basis for a vaccine composition 
which may be useful for effecting immunity against 
diseases caused by EHEC, in particular E.coli serotype 
0157 :H7, The invention therefore also provides a vaccine 
composition comprising a bacterial carrier strain 
transformed with a recombinant molecule of the invention. 
The vaccine composition may be useful in effecting 
immunity against diseases such as hemorrhagic colitis and 
hemolytic uremic syndrome. 

The vaccine compositions can be prepared by per se known 
methods for the preparation of pharmaceutically acceptable 
vaccines which can be administered to patients . The 
vaccine composition may be in an oral or injectable form 
and may include pharmaceutically acceptable vehicles. On 
this basis, the vaccine compositions include, albeit not 
exclusively, solutions of the bacterial carrier strain 
transformed with a recombinant molecule of the invention 
in association with one or more pharmaceutically 
acceptable vehicles or diluents, and contained in buffered 
solutions with a suitable pH and iso-osmotic with the 
physiological fluids. 

The protein associated with attaching and effacing 
activity of EHEC or parts thereof, may be used to prepare 
monoclonal or polyclonal antibodies. Antibodies may be 
prepared which bind a distinct epitope in an unconserved 
region of the protein. Conventional methods can be used to 
prepare the antibodies* As to the details relating to the 
preparation of monoclonal antibodies reference can be made 
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to Goding^Kw., Monoclonal Antibodies: ^^nciples and 

Practice, 2nd Ed., Academic Press, London, 1986, In 
general monoclonal antibodies are prepared by obtaining 
hybridomas derived from immortalizing antibody producing 
cells obtained from a mammal having been immunized with a 
protein, and screening the hybridomas for production of 
antibody which binds the isolated protein* 

The polyclonal or monoclonal antibodies may be used to 
detect a protein associated with attaching and effacing 
activity of EHEC in various biological materials or food 
samples, for example they may be used in an Elisa, 
radioimmunoassay or histochemical tests. Thus, the 
antibodies may be used as diagnostic reagents to quantify 
the amount of protein associated with attaching and 
effacing activity in a sample. 

The monoclonal and polyclonal antibodies may be labelled 
with a detectable marker including various enzymes, 
fluorescent materials, luminescent materials and 
radioactive materials . Examples of suitable enzymes 
include horseradish peroxidase, biotin, alkaline 
phosphatase , 0-galactosidase , or acetylchol inester ase ; 
examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, 
rhodamine , dichlorotriazinylamine fluorescein , dansyl 
chloride or phycoerythrin; an example of a luminescent 
material includes luminol; and examples of suitable 
radioactive material include I 125 , I 131 or tritium. 

The invention will be more fully understood by reference 
to the following examples. However, the examples are 
merely intended to illustrate embodiments of the invention 
and are not to be construed to limit the scope of the 
invention. 



EXAMPLES 

Example 1 



Cloning an quencing of eae gene homol from EHEC 

strain 0157 

The following materials and methods were used in the 
investigations illustrated in the examples 

Bacteriological media 

E-coli strains were grown in L-broth and L-agar (Lech, K. 
and Brent, R. (1989) in Current Protocols in Molecular 
Biology (Ausubel, F.M. et al. Eds.) John Wiley & Sons, New 
York) supplemented when necessary with lOOug/ml 
carbenicillin (Sigma Chemical Co,, St. Louis, MO). For 
production of single stranded DNA, strains were grown in 
2xYT broth (Sambrook, J., Fritch, E.F., and Maniatis, T. 
(1989) in Molecular Cloning, A Laboratory Manual (Nolan, 
C, Ed.) Cold Spring Harbor Laboratory Press, New York). 
Bacteriophage lambda was grown in lambda broth (Lech, K. 
and Brent, R. (1989) in Current Protocols in Molecular 
Biology (Ausubel, F.M. et al. Eds.) John Wiley & Sons, New 
York) . 

Bacterial strains and vectors 

E-coli strains E2348/69 (EPEC), serotype 0127:H6 and CL8 
(EHEC), serotype 0157 :H7 were obtained from 
Dr. M. Karmali, The Hospital for Sick Children, Toronto, 
Canada, and have been previously described (Levine, M.M. , 
Nataro, J. P., Karch, H. , Baldini, M.M., Kaper, J.B., 
Black, R.E., Clements, M.L. and O'Brien, A.D. (1985) J. 
Infect. Dis. 152, 550-559; Karmali, M.A., Petric , M. , Lim, 
C, Fleming, P.C., Arbus, G.S. and Lior, H. (1985) J. 
Infect. Dis. 151, 775-882; Sherman, P., Soni, R. and 
Karmali, M.A. (1988) Infect. Immun. 56, 756-761). E.coli 
JM101 (Messing, J. (1979) Recomb. DNA Tech. Bull. 2,43-48) 
was used as the host for plasmid pTZ18R (Mead, D.A., 
Szczesna-Skorupa, E. , and Kemper, B. (1986) Protein 
Engineering. 1,67-74) and E-coli LE392 (Borck, K., Beggs, 
J.D., Brammar, W.J., Hopkins, A.S., and Murray, N.E. 
(1976) Mol. Gen. Genet. 146, 199-207; Murray, 
N.E. , Brammar, W.J., and Murray, K. (1977) Mol. Gen. Genet. 
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150,53-61) was used for propagation of bacteriophage 
lambda GEM-11 (Promega, Madison, WI). Bacteriophage M13K07 
(Rokeach, L., Haselby, J. and Ahoch, S. (1988) Proc. Nat. 
Acad. Sci. USA . 85, 4832-4836) was used as a help r to 
produce single stranded DNA from pTZ18R and recombinant 
derivatives . 

DNA manipulation and sequencing 

Plasmid DNA was prepared by the method of Birnboim and 
Doly (Birnboim, H.C. and Doly, J. (1979) Nucl. Acids Res. 
7, 1513-1523). Bacterial genomic DNA was extracted using 
hexadecyltrimethyl ammonium bromide precipitation (Murray, 
M.G., and Thompson, W.F. (1980) Nucl. Acids Res. 8,4321- 
4325). Genomic DNA isolated from the EHEC strain CL8 was 
partially digested with Sau3A and fragments were size- 
fractionated by sucrose density gradient centrifugation 
(Sambrook, J., Fritch, E.F., and Maniatis, T. (1989) in 
Molecular Cloning, A Laboratory Manual (Nolan, C, Ed.) 
Cold Spring Harbor Laboratory Press, New York). The 
desired fragments (14-23kb) were isolated and ligated to 
dephosphorylated lambda Gem-11 BamHlarms (Promega, 
Wisconsin, MA] and packaged using the Packagene System 
extracts (Promega, Wisconsin, MA). Bacteriophage particles 
were propagated in E.coli LE392 and plated for plague 
isolation. 

The 1 Kb Stul-Sa21 fragment described by Jerse et al 
(Jerse, A., Yu, J., Tall, B. , and Kaper, J. (1990) Proc. 
Natl. Acad. Sci. USA 87, 7839-7843) was used as a probe 
for the eae gene. It was produced by amplification of the 
central region of the eae gene of strain E2348/69 by the 
polymerase chain reaction (Sambrook, J., Fritch, E.F., and 
Maniatis, T. (1989) in Molecular Cloning, A Laboratory 
Manual (Nolan, C, Ed.) Cold Spring Harbor Laboratory 
Press, New York) using Tag polymerase (Perkin Cetus Elmer 
Corp., Norwalk, CT) . Primers flanking the Stul and Sail 
sites were designed using the nucleotide sequence of the 
eae gene of E2348/69 deposited in Genbank by Jerse et al. 
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(Jerse, A. f Yu, J., Tall, B., and Kaper, J, (1990) Proc. 
Natl. Acad. Sci. USA 87, 7839-7843). The upstream primer 
was ATGGAATTCTCGTCACAGTTGCAGGCCTGGT (2241-2263 of the eae 
sequence) and the downstream primer was 
ATGG AAT TC CGAAGTC TTATC A - GCCGTAAAGT (3350-3328 of the eae 
sequence) with EcoRl recognition sequences included at the 
5 'end of each primer. The fragment was labelled with a 32 P 
dATP using a random priming kit (Boehringer Mannheim) .This 
probe was used in plaque hybridizations to further 
identify clones containing the putative eae sequence. 

Fragments which hybridized with the probe were subcloned 
in pTZ18R (Sambrook J, Fritch EF, Maniatis T. In: 
Molecular cloning, A Laboratory Manual, 1989* (Nolan C, 
Ed.), Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY.). Nested deletions were constructed by 
sequential digestion with exonuclease III and SI nuclease 
using the Erase-a-base System (Promega, Madison, WI). 
Single stranded templates were prepared from fragments 
cloned in pT218R by superinfection with bacteriophage 
M13K07. DNA sequencing was performed using the Sanger 
dideoxy chain termination method (Sanger, F . , Niklen, S. 
and Coulson, A. R. (1977) Proc . Nat .Acad. Sci. USA. 74, 5463- 
5468) with Sequenase Version 2.0 (United States 
Biochemicals, Cleveland, Ohio). The entire sequence of a 
3.5Kb region including the eae gene was determined using 
oligonucleotide primers where necessary. Sequence analysis 
was performed using the Wisconsin Genetics Computer Group 
sequence analysis software package version 6.0 (Devereux, 
J., Haeberli, P., and Smithies, O. (1984) Nucleic Acids 
Res. 12, 387-395). 

Nucleotide sequence and accession number 

The DNA sequence data was submitted to the EMBL database 
and assigned the accession number X60439. 

Of the plaques screened, several hybridized to the EPEC 
eae probe on primary screening and one designated AGEM-11 



D4-4 with ^^.nsert of 14Kb, was chosen f °^^ rt 2^7^ 1 ^ c 5 r g 
The fragm^^fc carrying the EHEC eae ^Mologue were 
identified by hybridization to the 1Kb EPEC probe. The 
entire predicted structural gene was sequenced including 
205 bp upstream and 118 bp downstream* Figure 1 shows the 
DNA sequence of the EHEC eae gene and the deduced protein 
sequence starting at the start codon at 206 bp. The end of 
the open reading frame is denoted by an asterisk. The 
sequence was compared with that of the EPEC eae gene. 
The EPEC and EHEC sequences are virtually identical for 
the first 2200 bp of the structural gene and for 
approximately 200 bp upstream of the start site. However, 
there is considerable divergence in the last 800 bp where 
the similarity is only 59% (Figure 2). 

The amino acid sequence alignment of the predicted EPEC 
and EHEC proteins is shown in Figure 3. The positions of 
the terminal cysteine residues are denoted by asterisks . 
The N- terminal amino acids are highly conserved for the 
first 685 amino acids while for the C- terminal amino 
acids, only short stretches can be aligned. There is 
conservation of cysteine residues at positions 859 and 9 33 
of the alignment. 

Jerse et al. first noted the similarity between the 
central regions of eae gene of EPEC and the inv gene of 
Yersinia pseudotuberculosis (Jerse AE, Yu J, Tall BD, 
Kaper, J.B. Proc Natl Acad Sci USA 1990; 87*7839-43). The 
invasin protein, a product of the inv gene is an outer 
membrane protein of Y. pseudotuberculosis which has been 
extensively studied by Isberg et al. (Isberg, R.R., 
Voorhis, D.L. and Falkow, S. (1987) Cell. 50,769-778). 
When expressed in E.coli K12, the inv gene is necessary 
and sufficient for invasion of HEp-2 cells in tissue 
culture (Isberg, R.R., and Falkow, S. (1985) Nature. 317, 
262-264). The invasin protein has been divided into three 
domains on the basis of mutational analysis and the study 
of fusion proteins (Isberg, R.R* (1989) Mol. Microbiol. 
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3,1449-1453). The amino terminus is importer for export 
beyond the inner membrane while the central region is 
critical for stable localization in the outer membrane. 
The last 192 residues at the carboxyl terminus have been 
shown to be responsible for binding to £1 integrin 
receptors on the eukaryotic cell surface (Leong, J.M., 
Foumier, S., and Isberg, R.R. (1990) EMBO. Jnl. 9,1979- 
1989)* This domain also carries the epitopes recognized by 
neutralizing monoclonal antibodies. Examination of the two 
sequences published for the inv genes of Y. enterocolitlca 
(Miller, V.L. and Falkow, S. (1988) Infect. Immun. 
56,1242-1248) and the Y. pseudotuberculosis 
(Isberg RR, Voorhis DL, and Falkow S. Cell 1987; 50:769- 
78) showed that there was similarity to the eae genes in 
15 the central one third of the sequence. The predicted 

protein products of the two inv genes also possess a 
subterminal cysteine residue with a second approximately 
80 amino acids upstream. This suggests that the carboxyl 
termini of all four proteins contain a disulfide loop 
which may be necessary for biologic activity. 



On the basis of the similarities between the inv and eae 
genes, the carboxyl termini of the eae proteins are 
predicted to be the receptor binding domains. It is likely 

25 that the dissimilarities in this region of the molecules 

are responsible for antigenic variation and possibly for 
differences in receptor binding specificity. Sherman et al 
(Sherman, P., Cockerill III, F., Soni, R. , and Brunton, J. 
(1991) Infect. Immun. 59,890-899) showed that antiserum 

30 raised against a 94 kilodalton outer membrane protein of 

an 0157 :H7 strain inhibited adherence and AE lesion 
formation by the homologous EHEC strain but had no effect 
on the adherence of the EPEC strain E2348/69. Differences 
in receptor specificity could explain the differences 

35 observed in intestinal colonization by EPEC and EHEC 

strains. EPEC colonize the entire intestine in animal 
models and the upper small bowel of humans. In 
contrast, EHEC strains colonize the cecum and large 
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intestine animal models while in hunSms they cause 
intense proximal colonic inflammation, characteristic of 
hemorrhagic colitis (Knutton, S., Baldwin, T., Williams, 
P.H., and McNeish, A. S. (1989) Infect. Immun. 57, 1290- 
1298; Tzipori, S., Wachsmuth, I., Smithers, J. , and 
Jackson, C, (1988) Gastroenterology 94, 590-597; 
Tzipori, S., Wachsmuth, I.K., Chapman, C, Birner, R. , 
Brittingham, J., Jackson, C, and Hogg, J. (1986) J. 
Infect. Dis. 154, 712-716; Riley, L. (1987) Annu. Rev. 
Microbiol. 41, 383-407; Karmali, M. (1989) Clin. 
Microbiol. Rev. 2, 15-38). Recent reports suggest that 
adhesions encoded by the high molecular weight plasmids of 
EPEC and EHEC strains are important in determining the 
specificity of adherence to tissue culture cells 
(Baldini, M.M. , Kaper, J.B. , Levine, M.M. , Candy, D.C.A., 
and Moon, H.W. (1983) J. Ped* Gastroenterol. Nutr. 2,534- 
538; Karch, H. , Hessemann, J. , Laufs, R. , O'Brien, A.D., 
Tacket, CO., and Levine, M.M. (1987) Infect. Immun. 
55,455-461; Cantey, J.R., and Moseley, S.L. (1991) Infect. 
Immun. 59, 3924-3929; Jerse, A.E., Gicquelais, K.G. , and 
Kaper, J.B. (1991) Inf ect . Immun . 59, 3869-3875). It 
remains to be seen whether differences in the C-terminal 
regions of the eae gene products of different EPEC and 
EHEC strains might result in alterations of receptor 
specificity. 

EXAMPLE 2 

Serotype Distribution of the EHEC eae Gene Among 
Verotoxin-Producing E.coli . 

Probes based on the EHEC eae gene were used to study the 
serotype distribution of the eae homologue in a collection 
of verotoxin-producing E.coli isolated from human animal 
and food sources using colony hybridization. Colony 
hybridizations were carried out using standard techniques 
under highly stringent conditions (Sambrook J, Fritch EF, 
Maniatis T . In: Molecular cloning, A laboratory 
Manual, 1989. (Nolan C, Ed.)# Cold Spring Harbor Laboratory 
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by PCR amplification using strain 0157 sH7 or strain 
0111 :H8. The primers used to amplify the probes are set 
forth in Table 1 and are also shown in the Sequence 
5 Listing as SEQ ID NOS: 2 to 9 . DNA fragments used as 

probes were labeled with [or 32 P] dATP using the Random 
Primed DNA Labeling Kit (Boehringer Mannheim). 
Oligonucleotide probes were end-labeled with [y 32 P] dATP 
using T4 polynucleotide kinase (Sambrook J, Fritch EF, 
10 Maniatis T. In: Molecular cloning, A laboratory 

Manual ,1989. (Nolan C, Ed.), Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY.). 

DNA for sequencing was made by PCR amplification using 
15 primers based on sequences within the 0157 :H7 eae gene. 

Sequencing was carried out using the ds DNA Cycle 
Sequencing System (BRL Life Technologies) according to the 
manufacturer • s instructions . 

20 The results of the colony blot hybridizations using probes 

based on the 0157 sH7 EHEC eae gene are shown in Table 2 
and Figure 4. The results show that whereas the central 
1 kb fragment C1-C2 hybridized to a variety of the 0 
serogroups of VTEC tested, a 0.5 kb fragment A3-B2 in the 

25 carboxyl-terminal region only hybridized to 0157 strains. 

Eae homologues among VTEC were detected in serogroups 
0157, 0111, 026, 05, 0103, 0145 and 0118 but not 0113 and 
0156 . A 0 . 5 kb fragment (A3-B2 ) derived from the less 
conserved 3 ' region hybridized with serogroups 0157 and 

30 occasional 0111 strains. However, a more specific 0.45 kb 

probe ( P1EH-P2EH) only hybridized with VTEC of serogroup 
0157 (Figure 5). 

Example 3 

35 Oligonucleotide primers specific to the EHEC eae gene were 

used in polymerase chain reaction assays to test 150 VTEC 
of diverse serogroups which were isolated from food, 
animal and human specimens. In particular, previously 
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characterized and serotyped isolates of verotoxin- 
producing E » coli and enteropathogenic E.coli strains 
derived from human, food and animal sources were 
contributed by A.Borczyk, C. Gyles and M.Karmali, The 
Hospital for Sick Children, Toronto, Canada- Several 
colonies were selected, resuspended in 50|il PCR buffer 
(XI) (Perkin Elmer Cetus) and boiled for 10 mins prior to 
use as template DNA in PCR amplifications. The PCR 
reaction conditions were as follows: 50*il reactions were 
set up using ImM deoxynucleotide triphosphates, IX PCR 
Buffer, 1 unit Tag polymerase (GeneAmp Kit, Perkin Elmer 
Cetus), 20 pmol of each primer and 1^1 sample preparation. 
Each cycle (X40) consisted of the following: 

denaturation @ 94°C X 1 min 

annealing 8 55°C X 1 min 

extension @ 72°C X 1 min. 



Primers based on the 0157 :H7 sequence and the 0111 
sequence, (primers PIO and P20) were used to amplify DNA 
from the various serogroups. The primers are shown in 
Table 1. The location of the A3, B2, P1EH, and P2EH 
primers on the 0157:h7 eae gene are shown in Figure 6. 

Colony blot hybridization using [y 32 P] dATP oligonucleotide 
probes was used to confirm the specificity of the primers. 
Colony blot hybridizations were performed as described in 
Example 2 . 

Verotoxin-producing E.coli of serogroups 0157 and 0111 
strains were detected by PCR with primers A3-B2, giving a 
0.5 kb fragment. Both primer sets A3-B2 and P1EH-P2EH did 
not detect non verotoxin-producing E . coli of the 
serogroups 0157 and 0111. Primer set P1EH-P2EH 
specifically detected only 0157 verotoxin positive strains 
giving a 0.45 kb fragment (Table 3, Figure 7). Figure 7 
shows the amplification products of VTEC and EPEC using 
specific primer sets A3-B2 (0.5 kb) and PIEH-P2EH (0.45kb) 
and in particular the following: 
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LANES 1,20:MW STANDARDS 

LANE 3:0157:H7+ 

LANE 5:0111:H8+ 

LANE 7:0111:NM+ 

A3-B2 PRIMERS 

LANE 8:05:NM - 

LANE 9:0127:H6- 

LANE 13:026:H11- 

LANES 10,11:NEG CONTROLS 

LANE 14:0157:H7+ 

LANE 15:0111:H8- 

P1EH-P2EH PRIMERS 

LANE 16:0111sNM- 

LANE 19:NEG CONTROL 

Colony blot hybridization using [y 32 P] dATP oligonucleotide 
probes confirmed the specificity of primers P1EH-P2EH data 
obtained from the PCR. Only vero toxin-producing 0157 
serogroups hybridized with P1EH and P2EH (Figure 8). 

Attempts to amplify DNA from serogroups 05, 026 and 0118 
using primers based on the 0157 :H7 sequence were 
unsuccessful. The only serogroup which gave a PCR 
amplification product was an 0111 :H8 strain using primers 
2214 and 3011. Comparisons of the C-terminal ends of the 
eae genes from an 0157:H7, 0111 :H8, an EPEC strain and the 
Y. pseudotuberculosis inv gene are shown in Figure 9. 

Based on the 0111 sequence, primers P10 and P20 were 
designed which gave a PCR product for 0111 :NM and 0111 :H8 
verotoxin-producing (VT+) strains but not 0111 :H11 
indicating that there are differences among (VT+) 0111 
serotypes. However, non vero toxin-producing (VT- ) 0111 :H2 
and 0111 :H12 strains were negative. 

The present invention has been described in detail and 
with particular reference to the preferred embodiments; 
however, it will be understood by one having ordinary 
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mf 

skill in tn^art that changes can be made thereto without 
departing from the spirit and scope thereof. 

Forming part of the present disclosure are the appended 
sequence listings. 
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PRIMERS 

CI 5'ATG GAA TTC TCG TCA CAG TOG CAG GCC TGG T 3 1 sense 

C2 5'ATG GAA TTC CGA AGT CTT ATC AGC CGT AAA GT 3 1 ant i sense 

A3 5'ATG GAA TTC GGA TGT TCA ACG GTA AGT CT 3' sense 

B2 5'ATG GAA TTC ACA ATA CCG TCT GTG TGG AT 3' antisense 

2214 5'ATG GGA TCC CCG ATA TCG CAG TAG CAA ATG GTA AGG AT 3 ' sense 

3011 5'ATG AAG CTT GAA TTC TCT ACA CAA ACC GCA T 3* antisense 

P1EH 5' AAG CGA CTG AGG TCA CT 3' sense 

P2EH 5 'ACG CTG CTC ACT AGA TGT 3* antisense 

PIO* 5' AGA CCT AGG TTA CAT TT.. 3 • sense 
P20* 5* TAT TTT ATC AGC TTC AGT 3' antisense 
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TABLE 2 



SEROQROUP DISTRIBUTION OF EAE-HOMOLOGUES 
AMONG VEROTOXIN-PRODUCING E.COU 

PROBE 

TRAL y REGION 



SEROGROUP 

01 57 

0111 

026 

05 

01 03 

0145 

0118 

0113 

01 56 

0127:H6 
(E2348) 

015:H- 
(RDEC1) 



C1-C2 
(1.0kb) 

67/67 
11 / 11 
12/12 
7/ 7 
4/ 4 
5/ 5 

1 / V 
0/ 5 

0/ 4 

1 / 1 

1/1 



A3-B2 



18 

2 
0 
0 
0 
0 
0 
0 
0 
0 



18 
10* 
7 
6 
3 

13 
1 
4 
4 
1 



0/ 1 



P1EH-P2EH 
(0.45 kb) 

7/ 7 

0/ S 

0/ 2 

0/ 1 

0/ 1 

0/ 2 
0/ 1 

0/ 1 
0/ 1 
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TABLE 3 



DETECTION OF 0157-.H7 ££OJi BY PGR 
USING SPECIFIC PRIMERS 

PRIIflER SET 
A3-B2 P1EH-P2EH 
SEROGROUPS #Pos / #T@stsd #P©s / #Tested 



01 57 (VT+) 
(H7.NM) 


18/18 


18 / 18 


01 57 (VT-) 


0/ 8 


0/ 8 


0111 (VT+) 
(NM,H8,H11) 


17 / 18 


0/12 


0111 (VT-) 
(H2,H12) 


0/ 2 


0/ 2 


026 


0/ 6 


0/ 2 


VTEC* 


0/27 


0/16 


ETEC** 


0/ 5 


0/ 3 


EPEC*** 


0/ 4 


0/ 1 


ElEC 
(0136) 


0/ 1 


0/ 1 



* VTEC (01, 02, 05, 07, 016, 018. 022, 036. 040. 042, 091, O103. 0113. 
01 17, 01 18, 0121, 0145, OR, OT) 

** ETEC (O6, 0115, 0148, 0153) 

***EPEC (015, 0119, 0126, 0127) 
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SEQUENCE LISTING 

1. GENERAL INFORMATION 

(a) APPLICANT : Joyce de Azavedo, James Brunton 
and Marie Louie 

(b) TITLE OF INVENTION: Attaching and Effacing 
Protein of Enterohemorrhagic E .coli 

(C) NUMBER OF SEQUENCES: 9 

2. CORRESPONDENCE ADDRESS: 

(a) ADDRESSEE: Linda M. Kurdydyk, 
Bereskin & Parr 

(b) STREET: Box 401, 40 King Street West 

(c) CITY: Toronto, Ontario 

(d) COUNTRY: Canada 

(e) ZIP CODE: M5H 3Y2 

3. COMPUTER READABLE FORM: 

(a) MEDIUM TYPE: Floppy disk 

(b) COMPUTER: IBM PC compatible 

(c) OPERATING SYSTEM: PC-DOS /MS-DOS 

(d) SOFTWARE: DOS Text File 

4. CURRENT APPLICATION DATA: 

(a) APPLICATION NUMBER: Unknown 

(b) FILING DATE: Unknown 

(c) CLASSIFICATION: Unknown 

5. ATTORNEY/AGENT INFORMATION: 

(a) NAME: Linda M, Kurdydyk 

(b) REGISTRATION NUMBER: 34,971 

(c) REFERENCE NUMBER/DOCKET NUMBER: 3153-057/LMK 

6. TELECOMMUNICATION INFORMATION: 

(a) TELEPHONE: (416) 364-7311 

(b) TELECOPY: (416) 361-1398 

7. INFORMATION FOR SEQ ID NO:l: 

(a) SEQUENCE CHARACTERISTICS: 

(i) LENGTH: 3131 base pairs 

(ii) TYPE: nucleic acid 

(iii) STRANDEDNESS: double 

(iv) TOPOLOGY: linear 

(b) MOLECULAR TYPE: DNA (genomic) 

(c) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



TCGAGAATGAAATAGAAGTCGTTGTTAAGTCAATGGAAAACCTGTATTTGGTATTACATA 

: + + + + + + 60 

AGCTCTTACTTTATCTTCAGCAACAATTCAGTTACCTTTTGGACATAAACCATAATGTAT 
SRMK*KSLLSQWKTC IWYY I 
RE*NRSRC*VNGKPVFGIT*- 
ENE I EVVVKSMENLYLVLHN- 
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ATCAGGGAATAACATTAGAAAACGAACATATGAAAATAGAGGAAATCAGTTCAAGCGACA 
61 + + + + + .+ 120 

TAGTCCCTTATTGTAATCTTTTGCTTGTATACTTTTATCTCCTTTAGTCAAGTTCGCTGT 
a IRE*H*KTNI*K*RKSVQAT 
b SGNNIRKRTYENRGNQFKRQ- 
c QGITLENEHMKIEEISSSDN- 



ATAAACATTATTACGCCGGAAGATAAAATCCGATCTATTAATATAATTTATTTCTCATTC 
121 + + + + + + 180 

TATTTGTAATAATGCGGCCTTCTATTTTAGGCTAGATAATTATATTAAATAAAGAGTAAG 
a INIITPEDKIRSINIIYFSF 
b *TLLRRKIKSDLLI*FISHS- 
c KHYYAGR * N P I Y*YNLFL I L- 



TAACTCATTGTGGTGGAGCCA'JAACATGATTACTCATGGTTGTTATACCCGGACCCGGCA 

181 + + + + + + 240 

ATTGAGTAACACCACCTCGGTATTGTACTAATGAGTACCAACAATATGGGCCTGGGCCGT 

a *LIVVEP*HDYSWLLYPDPA 

b NSLWWSHNMITHGCYTRTRH- 

C THCGGAI T * L L M V V I PGPG T - 



CAAGCATAAGCTAAAAAAAACATTGATTATGCTTAGTGCTGGTTTAGGATTGTTTTTTTA 
241 + + + + + + 300 

GTTCGTATTCGATTTTTTTTGTAACTAATACGAATCACGACCAAATCCTAACAAAAAAAT 
a QA*AKKNIDYA*CWFRIVFL 
b KHKIiKKTLIMLSAGLGLFFY- 
c SIS*KKH*LCLVLV*DCFFM- 



TGTTAATCAGAATTCATTTGCAAATGGTGAAAATTATTTTAAATTGGGTTCGGATTCAAA 
301 + + + + + + 360 

ACAATTAGTCTTAAGTAAACGTTTACCACTTTTAATAAAATTTAACCCAAGCCTAAGTTT 
a C *SEFICKW*KLF* IGFGFK 
b VNQNSFANGENYFKLGSDSK- 
c LIR IHLQMVK I ILNWVR I QN- 



ACTGTTAACTCATGATAGCTATCAGAATCGCCTTTTTTATACGTTGAAAACTGGTGAAAC 

361 + + + + + 4- 420 

TGACAATTGAGTACTATCGATAGTCTTAGCGGAAAAAATATGCAACTTTTGACCACTTTG 

a TVNS**LSESPFLYVENW*N 

b LLTHDSYQNRLFYTLKTGET- 

c C*LMIAIRIAFFIR*KLVKL- 
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TGTTGCCGATCTTTCTAAATCGCAAGATATTAATTTATCGACGATTTGGTCGTTGAATAA 
421 + + + + + + 480 

ACAACGGCTAGAAAGATTTAGCGTTCTATAATTAAATAGCTGCTAAACCAGCAACTTATT 
a CCRSF*IARY*FIDDLVVE* 
b VADLSKSQDlNLSTIWSIiNK- 
c LPIFLNRKIIjIYRRFGR* I S- 



GCATTTATACAGTTCTGAAAGCGAAATGATGAAGGCCGCGCCTGGTCAGCAGATCATTTT 
481 + + + + + + 540 

CGTAAATATGTCAAGACTTTCGCTTTACTACTTCCGGCGCGGACCAGTCGTCTAGTAAAA 
a AFIQF*KRNDEGRAWSADHF 
b HLYSSESE-MMKAAPGQQIIL- 
c IYTVLKAK* *RPRLVSRSFC- 



GCCACTCAAAAAACTTCCCTTTGAATACAGTGCACTACCACTTTTAGGTTCGGCACCTCT 
541 + + + + + + 600 

CGGTGAGTTTTTTGAAGGGAAACTTATGTCACGTGATGGTGAAAATCCAAGCCGTGGAGA 
a ATQKTSL* IQCTTTFRFGTS 
b PLKKLPFEYSALPLLGSAPL- 
C HSKNFPLNTVHYHF *VRHLL- 



TGTTGCTGCAGGTGGTGTTGCTGGTCACACGAATAAACTGACTAAAATGTCCCCGGACGT 

601 + + + + ■+ + 660 

ACAACGACGTCCACCACAACGACCAGTGTGCTTATTTGACTGATTTTACAGGGGCCTGCA 

a CCCRWCCWSHE*TD*NVPGR 

b VAAGGVAGHTNKLTKMSPDV- 

c LLQVVLLVTR I N * L K C P R T * - 



GACCAAAAGCAACATGACCGATGACAAGGCATTAAATTATGCGGCACAACAGGCGGCGAG 
661 + + + + + +.720 

CTGGTTTTCGTTGTACTGGCTACTGTTCCGTAATTTAATACGCCGTGTTGTCCGCCGCTC 
a DQKQHDR*QGIKLCGTTGGE 
b TKSNMTDDKALNYAAQQAAS- 
c PKAT*PMTRH* IMRHNRRRV- 



TCTCGGTAGCCAGCTTCAGTCGCGATCTCTGAACGGCGATTACGCGAAAGATACCGCTCT 
721 + + + + + + 780 

AGAGCCATCGGTCGAAGTCAGCGCTAGAGACTTGCCGCTAATGCGCTTTCTATGGCGAGA 
a SR* PASVAISERRLRERYRS 
b LGSQLQSRSLNGDYAKDTAIi- 
C SVAS FSRDL * TA I TRK I P LL- 
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TGGTATCGCTGGTAACCAGGCTTCGTCACAGTTGCAGGCCTGGTTACAACATTATGGAAC 
781 + + + +- + + 840 

ACCATAGCGACCATTGGTCCGAAGCAGTGTCAACGTCCGGACCAATGTTGTAATACCTTG 
a WYRW*PGFVTVAGLVTTLWN 
b GIAGNQASSQLQAWLQHYGT- 
c VSLVTRLRHSCRPGYNIMER- 



GGCAGAGGTTAATCTGCAGAGTGGTAATAACTTTGACGGTAGTTCACTGGACTTCTTATT 
841 + + + + + + 900 

CCGTCTCCAATTAGACGTCTCACCATTATTGAAACTGCCATCAAGTGACCTGAAGAATAA 
a GRG*S AEW* * L * R * FTGLLI 
b AEVNLQSGNNFDGSSLDFLL- 
c QRLICRVV ITLTVVHWTSYY- 



ACCGTTCTATGATTCCGAAAAAATGCTGGCATTTGGTCAGGTCGGAGCGCGTTACATTGA 
901 + + + + + + 960 

TGGCAAGATACTAAGGCTTTTTTACGACCGTAAACCAGTCCAGCCTCGCGCAATGTAACT 
a TVL*FRKNAGIWSGRSALH* 
b PFYDSEKMLAF GQVGARYID- 

C RSMIPKKCWHLVRS ERVTLT- 



CTCCCGCTTTACGGCAAATTTAGGTGCGGGTCAGCGTTTTTTCCTTCCTGCAAACATGTT 
961 + + + + + + 1020 

GAGGGCGAAATGCCGTTTAAATCCACGCCCAGTCGCAAAAAAGGAAGGACGTTTGTACAA 
a LPIiYGKFRCGSAFFPSCKHV 
b SRFTANLGAGQRFFLPANML- 
C PALRQI*VRVSVFSFLQTCW- 



GGGCTATAACGTCTTCATTGATCAGGATTTTTCTGGTGATAATACCCGTTTAGGTATTGG 
1021 + + + + + + 1080 

CCCGATATTGCAGAAGTAACTAGTCCTAAAAAGACCACTATTATGGGCAAATCCATAACC 
a GL*RLH*SGFFW**YPFRYW 
b G YNVFIDQDFSGDNTRLGIG- 

C AI TSSLIRIFLVIIPV*VLV- 



TGGCGAATACTGGCGAGACTATTTCAAAAGTAGCGTTAACGGCTATTTCCGCATGAGCGG 
1081 + + + + + + 1140 

ACCGCTTATGACCGCTCTGATAAAGTTTTCATCGCAATTGCCGATAAAGGCGTACTCGCC 
a WRILARLFQK *R*RLFPHER 
b GEYWRDYFKSSVNGYFRMSG- 
c A NTGE T I SKVALTA I SA * A A - 
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ctggcatgagtcatacaaMKgaaagactatgatgagcgcccagcaaatggcttcgatat 

1141 + + + + + + 1200 

gaccgtactcagtatgttattctttctgatactactcgcgggtcgtttaccgaagctata 
a la*viq*erl**apskwlry 

b WHESYNKKDYDERPANGFDI- 
c GMSHTIRKTMMSAQQMASIS- 



CCGTTTTAATGGCTATCTACCGTCATATCCGGCATTAGGCGCCAAGCTGATATATGAGCA 
1201 + + + + + + 1260 

GGCAAAATTACCGATAGATGGCAGTATAGGCCGTAATCCGCGGTTCGACTATATACTCGT 
a PF*WI*STVISGIRRQADI*A 
b RFNGYLPSYPALGAKLIYEQ- 
c V L M A I Y R H I RH * AP S * YM S S- 



GTATTATGGTGATAATGTTGCTTTGTTTAATTCTGATAAGCTGCAGTCGAATCCTGGTGC 

1261 + + + + + + 1320 

CATAATACCACTATTACAACGAAACAAATTAAGACTATTCGACGTCAGCTTAGGACCACG 
a VLW**CCFV*F**AAVESWC 
b YYGDNVALFNSDKLQSNPGA- 
c IMVIMLLCLILISCSRILVR- 



GGCGACCGTTGGTGTAAACTATACTCCGATTCCTCTGGTGACGATGGGGATCGATTACCG 
1321 + + + + + + 1380 

CCGCTGGCAACCACATTTGATATGAGGCTAAGGAGACCACTGCTACCCCTAGCTAATGGC 
a GDRWCKLYSDSSGDDGDRLP 
b ATVGVNYTPIPLVTMG IDYR- 

c RPLV*TILRFLW*RWG SITV- 



TCATGGTACGGGTAATGAAAATGATCTCCTTTACTCAATGCAGTTCCGTTATCAGTTTGA 
1381 + + + *■ + + 1440 

AGTACCATGCCCATTACTTTTACTAGAGGAAATGAGTTACGTCAAGGCAATAGTCAAACT 
a SWYG**K*SPLLNAVPLSV* 
b HGTGNENDLLYSMQFRYQFD- 
c MVRVMKMISFTQCSSVISLI- 



TAAATCGTGGTCTCAGCAAATTGAACCACAGTATGTTAACGAGTTAAGAACATTATCAGG 
1441 + + + + + + 1500 

ATTTAGCACCAGAGTCGTTTAACTTGGTGTCATACAATTGCTCAATTCTTGTAATAGTCC 
a *IVVSAN*TTVC*RVKNIIR 
b KSWSQQIEPQYVNELRTLSG- 
C NRGLSKLNHSMLTS* EHYQA- 
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CAGCCGTTACGATCTGGTTCAGCGTAATAACAATATTATTCTGGAGTACAA3RAGCAGGA 
1501 + + + + + + 1560 

GTCGGCAATGCTAGACCAAGTCGCATTATTGTTATAATAAGACCTCATGTTCTTCGTCCT 
a QPLRSGSA **QYYSGVQEAG 
b SRYDLVQRNNNIILEYKKQD- 
C AVTIWFSVITILFWSTRSRI- 



TATTCTTTCTCTGAATATTCCGCATGATATTAATGGTACTGAACACAGTACGCAGAAGAT 
1561 + + + + 4- + 1620 

ATAAGAAAGAGACTTATAAGGCGTACTATAATTACCATGACTTGTGTCATGCGTCTTCTA 
a YS 'FSEYSA *Y*WY*TQYAED 
b ILSLNIPHDINGTEHSTQKI- 
C F F L * IFRMILMVLNTVRRRF- 



TCAGTTGATCGTTAAGAGCAAATACGGTCTGGATCGTATCGTCTGGGATGATAGTGCATT 
1621 + + 1- + + + 1680 

AGTCAACTAGCAATTCTCGTTTATGCCAGACCTAGCATAGCAGACCCTACTATCACGTAA 
a SVDR*EQIRSG SYRLG**CI 
b QLIVKSKYGLDRIVWDDSAL- 
c S*SLRANTVWIVSSGMIVHY- 



ACGCAGTCAGGGCGGTCAGATTCAGCATAGCGGAAGCCAAAGCGCACAAGACTACCAGGC 
1681 + + + + + + 1740 

TGCGTCAGTCCCGCCAGTCTAAGTCGTATCGCCTTCGGTTTCGCGTGTTCTGATGGTCCG 
a TQSGRSDSA*RKPKRTRI*PG 
b RSQGGQIQHSGSQSAQDYQA- 
C AVRAVRF S IAEAKAHKTTRL- 



TATTTTGCCTGCTTATGTGCAAGGTGGCAGCAATATTTATAAAGTGACGGCTCGCGCCTA 
1741 + h + + + + 1800 

ATAAAACGGACGAATACACGTTCCACCGTCGTTATAAATATTTCACTGCCGAGCGCGGAT 
a YFACLCARWQQYL*SDGSRI> 
b ILPAYVQGGSN IYKVTARAY- 

c FCLLMCKVAAI FIK * RLAPM- 



TGACCGTAATGGCAATAGCTCTAACAATGTACAGCTTACTATTACCGTTCTGTCGAATGG 
1801 + + + + + + I860 

ACTGGCATTACCGTTATCGAGATTGTTACATGTCGAATGATAATGGCAAGACAGCTTACC . 
a *P*WQ*Ii*QCTAYYYRSVEW 
b DRNGNSSNNVQLTITVLSNG- 
C TVMAIALTMYSLLLP FCRMV- 
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TCAAGTTGTCGACCAGGTTGGGGTAACGGACTTTACGGCGGATAAGACTTCGGCTAAAGC 

1861 + + + + + + 1920 

AGTTCAACAGCTGGTCCAACCCCATTGCCTGAAATGCCGCCTATTCTGAAGCCGATTTCG 

a SSCRPGWGNGLYGG*DFG*S 

b QVVDQVGVTDFTADKTSAKA- 

C KLS TRLG *RTLRRIRLRI*KR- 



GGATAACGCCGATACCATTACTTATACCGCGACGGTGAAAAAGAATGGGGTAGCTCAGGC 
192 i + + + + + 1980 

CCTATTGCGGCTATGGTAATGAATATGGCGCTGCCACTTTTTCTTACCCCATCGAGTCCG 
a G*R RYHYLYRDGEKEWGSSG 
b DNADTITYTATVKKNGVAQA- 
c iTPIPLIiIPRR*KRMG*LRL- 



TAATGTCCCTGTTTCATTTAATATTGTTTCAGGAACTGCAACTCTTGGGGCAAATAGTGC 
1981 + + + + + + 2040 

ATTACAGGGACAAAGTAAATTATAACAAAGTCCTTGACGTTGAGAACCCCGTTTATCACG 
a ★CPCFI*YCFRNCNSWGK*C 
b NVPVSFN IVSGTATLGANSA- 

c MS LFHLIIjF QELQLIjGQ I V P - 



CAAAACGGATGCTAACGGTAAGGCAACCGTAACGTTGAAGTCGAGTACGCCAGGACAGGT 

2041 + + +— + + + 2100 

GTTTTGCCTACGATTGCCATTCCGTTGGCATTGCAACTTCAGCTCATGCGGTCCTGTCCA 

a QNGC*R*GNRNVEVEYARTG 

b KTDANGKATVTLKSS TPGQV- 

c KRMLTVRQP * R * SRVRQ DR S - 



CGTCGTGTCTGCTAAAACCGCGGAGATGACTTCAGCACTTAATGCCAGTGCGGTTATATT 

2101 + + + +" + + 2160 

GCAGCACAGACGATTTTGGCGCCTCTACTGAAGTCGTGAATTACGGTCACGCCAATATAA 

a RRVC*NRGDDFST*CQCGY I 

b VVSAKTAEMTSALNASAVIF- 

C S C LLKPRR *LQHLMPVRLY F - 



TTTTGATCAAACCAAGGCCAGCATTACTGAGATTAAGGCTGATAAGACAACTGCAGTAGC 

+ + + + + + 

AAAACTAGTTTGGTTCCGGTCGTAATGACTCTAATTCCGACTATTCTGTTGACGTCATCG 



a F * SNQGQHY*D*G** DNCS S 
b FDQTKAS ITEIKADKTTAVA- 

c LIKPRPALLRLRLIRQLQ* Q - 
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AAATGGTAAGGATGCTATT^^^ATACTGTAAAAGTTATGAAAAACGGTCA^MLGTTAA 

2221 + + + + + 2280 

TTTACCATTCCTACGATAATTTATATGACATTTTCAATACTTTTTGCCAGTCGGTCAATT 
a XW*GCY*IYCKSYEKRSAS* 
b NGKDAIKYTVKVMKNGQPVN- 
C MVRMIiLNIL*KIi*KTVSQLI- 



TAATCAATCCGTTACATTCTCAACAAACTTTGGGATGTTCAACGGTAAGTCTCAAACGCA 
2281 + + + + + + 2340 

ATTAGTTAGGCAATGTAAGAGTTGTTTGAAACCCTACAAGTTGCCATTCAGAGTTTGCGT 
a *SIRYILNKLWDVQR*VSNA 
b NQSVTFSTNFGMFNGKSQTQ- 
c INPLHSQQTLG CSTVSLKRK- 



AGCAACCACGGGAAATGATGGTCGTGCGACGATAACACTAACTTCCAGTTCCGCCGGTAA 
2341 + + + + + + 2400 

TCGTTGGTGCCCTTTACTACCAGCACGCTGCTATTGTGATTGAAGGTCAAGGCGGCCATT 
a SNHGK*WSCDDNTNFQFRR* 
b ATTGNDGRATITLTSSSAGK- 
c QPREMMVVRR*H*LPVPPVK- 



AGCGACTGTTAGTGCGACAGTCAGTGATGGGGCTGAGGTTAAAGCGACTGAGGTCACTTT 
2401 + + + + + + 2460 

TCGCTGACAATCACGCTGTCAGTCACTACCCCGACTCCAATTTCGCTGACTCCAGTGAAA 
a SDC*CDSQ*WG*G*SD*GHF 
b ATVSATVSDGAEVKATEVTF 
C RLLVRQSVMGLRLKRLRSLF- 



TTTTGATGAACTGAAAATTGACAACAAGGTTGATATTATTGGTAACAATGTCAAGAGGTC 
2461 + + + + + + 2520 

AAAACTACTTGACTTTTAACTGTTGTTCCAACTATAATAACCATTGTTACAGTTCTCCAG 
a F**TEN*QQG*YYW*QCQEV 
b FDELKIDNKVDI IGNNVKRS - 

c LMN*KLTTRLILLVTMSRGR- 



GATGTTGCCTAATATTTGGCTGCAATATGGTCAGTTTAAACTGAAAGCAAGCGGTGGTGA 
2521 + + + + + + 2580 

CTACAACGGATTATAAACCGACGTTATACCAGTCAAATTTGACTTTCGTTCGCCACCACT 
a DVA*YLAAIWSV*TESKRW* 
b MLPNIWLQYGQFKLKASGGD- 
C C C It I FGC NMV S LN * K QAVV M - 
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TGGTACATATTCATGGTATTCAGAAAATACCAGTATCGCGACTGTCGATGCATCAGGGAA 
2581 + + + + + + 2640 

ACCATGTATAAGTACCATAAGTCTTTTATGGTCATAGCGCTGACAGCTACGTAGTCCCTT 
a WYIFMVFRKYQYRDCRCIRE 
b GTYSWYSENTS IATVDASGK- 

c VHI HGIQKIPVSRLSMHQGK- 



AGTCACTTTGAATGGTAAAGGCAGTGTCGTAATTAAAGCCACATCTGGTGATAAGCAAAC 
2641 + + + + + + 2700 

TCAGTGAAACTTACCATTTCCGTCACAGCATTAATTTCGGTGTAGACCACTATTCGTTTG 
a SHFEW*RQCRN*SHIW**AN 
b VTLNGKGSVVIKATSGDKQT- 
C SL*MVKAVS*LKPHLVISKQ- 



AGTAAGTTACACTATAAAAGCACCGTCGTATATGATAAAAGTGGATAAGCAAGCCTATTA 

2701 - + + + + + + 2760 

TCATTCAATGTGATATTTTCGTGGCAGCATATACTATTTTCACCTATTCGTTCGGATAAT 
a SKLHYKSTVVYDKSG*ASLL 
b VSYTIKAPSYMIKVDKQAYY- 
c *VTIi*KHRRI**KWISKPIM- 



TGCTGATGCTATGTCCATTTGCAAAAATTTATTACCATCCACACAGACGGTATTGTCAGA 
2761 + + + + + + 2820 

ACGACTACGATACAGGTAAACGTTTTTAAATAATGGTAGGTGTGTCTGCCATAACAGTCT 
a C*CYVHLQKFITIHTDGIVR 
b ADAMS ICKNIiLPSTQTVLS D - 

c LMLCPFAKIYYHPHRRYCQI- 



TATTTATGACTCATGGGGGGCTGCAAATAAATATAGCCATTATAGTTCTATGAACTCAAT 

2821 + + + + + '-+ 2880 

ATAAATACTGAGTACCCCCCGACGTTTATTTATATCGGTAATATCAAGATACTTGAGTTA 
a YL*LMGGCK*I*PL*FYEIiN 
b IYDSWGAANKYSHYSSMNSI- 
c FMTHGGI»QINIAIIVL*TQ*- 



AACTGCTTGGATTAAACAGACATCTAGTGAGCAGCGTTCTGGAGTATCAAGCACTTATAA 

2881 + + + + + + 2940 

TTGACGAACCTAATTTGTCTGTAGATCACTCGTCGCAAGACCTCATAGTTCGTGAATATT 
a NCIiD*TDI * *AAFWSIKHL* 

b TAWIKQTSSEQRSGVSSTYM- 
C LLGLNRHLVS SVLEYQAL I T - 
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CCTAATAACACAAAACCCTCTTCCTGGGGTTAATGTTAATACTCCAAATGTCTATGCGGT 
2941 + + + + + + 3000 

GGATTATTGTGTTTTGGGAGAAGGACCCCAATTACAATTATGAGGTTTACAGATACGCCA 
a PNNTKPSSWG*C*YSKCLCG 
b LITQNPLPGVNVNTPNVYAV 
C * *HKTLFLGLML ILQMSMRF- 



TTGTGTAGAATAATTCCATAACCACCCCGGCTAAAATATGTATTGTTTTAGTCGGGGCAT 
3001 + + + + -+ + 3060 

AACACATCTTATTAAGGTATTGGTGGGGCCGATTTTATACATAACAAAATCAGCCCCGTA 
a LCRIIP*PPRLKYVLF*SGH 
b CVE*FHNHPG*NMYCFSRGI- 
c V*NNSITTPAKICIVLVGA*- 



AATTATTTCTTCTTAAGAAATAACCCTCTTATAATCAAATCTACTACTGGTCTTTTTATC 
3061 + + + + + + 3120 

TTAATAAAGAAGAATTCTTTATTGGGAGAATATTAGTTTAGATGATGACCAGAAAAATAG 
a NYFFLRNNPLIIKSTTGLFI 
b IISS*EITLL*SNLLLVFLS- 
C LFLLKK*PSYNQIYYWSFYL- 



TGCTTAATAGG 

3121 +- 3131 

ACGAATTATCC 

a C L I 
b A * * 

c L N R 



8. INFORMATION FOR SEQ ID NO: 2: 

(a) SEQUENCE CHARACTERISTICS: 

(i) LENGTHS 31 base pairs 

(ii) TYPE: nucleic acid 

(iii) STRANDEDNESS : single 

( iv ) TOPOLOGY : 1 inear 

(b) MOLECULAR TYPE: DNA (genomic) 

(c) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

5 ■ ATG GAA TTC TCG TCA CAG TTG CAG GCC TGG T 3' 
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9. INFORMATION FOR SEQ ID NO: 3: 

(a) SEQUENCE CHARACTERISTICS: 

(i) LENGTH: 32 base pairs 

(ii) TYPE: nucleic acid 

(iii) STRANDEDNESS : single 

( iv ) TOPOLOGY : 1 inear 

(b) MOLECULAR TYPE: DNA (genomic) 

(c) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



5'ATG GAA TTC CGA AGT CTT ATC AGC CGT AAA GT 



10. INFORMATION FOR SEQ ID NO: 4: 

(a) SEQUENCE CHARACTERISTICS: 

(i) LENGTH: 29 base pairs 

(ii) TYPE: nucleic acid 

(iii) STRANDEDNESS: single 

( iv ) TOPOLOGY : linear 

(b) MOLECULAR TYPE: DNA (genomic) 

(c) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



5 ' ATG GAA TTC GGA TGT TCA ACG GTA AGT CT 3 ' 



11. INFORMATION FOR SEQ ID NO: 5: 

(a) SEQUENCE CHARACTERISTICS: 

(i) LENGTH: 29 base pairs 

(ii) TYPE: nucleic acid 

( iii ) STRANDEDNESS : single 

( iv ) TOPOLOGY : 1 inear 

(b) MOLECULAR TYPE: DNA (genomic) 

(c) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



5' ATG GAA TTC ACA ATA CCG TCT GTG TGG AT 3' 
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12. INFORMATION FOR SEQ j3tNO:6: 

(a) SEQUENCE CHARACTERISTICS: 

(i) LENGTH: 17 base pairs 

(ii) TYPE: nucleic acid 

(iii) STRANDEDNESS : single 

( iv ) TOPOLOGY : 1 inear 

(b) MOLECULAR TYPE: DNA (genomic) 

(c) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

5 ' AAG CGA CTG AGG TCA CT 3 1 

13. INFORMATION FOR SEQ ID NO:7: 

(a) SEQUENCE CHARACTERISTICS: 

(i) LENGTH: 18 base pairs 

(ii) TYPE: nucleic acid 

(iii) STRANDEDNESS: single 

( i v ) TOPOLOGY : 1 inear 

(b) MOLECULAR TYPE: DNA (genomic) 

(c) SEQUENCE DESCRIPTION: SEQ ID NO: 7; 

5'ACG CTG CTC ACT AGA TGT 3' 

14. INFORMATION FOR SEQ ID NO:8: 

(a) SEQUENCE CHARACTERISTICS: 

(i) LENGTH: 17 base pairs 

(ii) TYPE: nucleic acid 

(iii) STRANDEDNESS: single 

( i v ) TOPOLOGY : 1 inear 

(b) MOLECULAR TYPE: DNA (genomic) 

(c) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

5 ' AGA CCT AGG TTA CAT TT 3 * 
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15. x WFORMATION FOR SEQ ID NO: 9: 



2078716 



(a) SEQUENCE CHARACTERISTICS: 

(i) LENGTH: 18 base pairs 

(ii) TYPE: nucleic acid 

(iii) STRANDEDNESS : single 

(iv) TOPOLOGY: linear 

(b) MOLECULAR TYPE: DNA (genomic) 

(c) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



5 ' TAT TTT ATC AGC TTC AGT 3 * 
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xriE EMBODIMENTS OF THE INVENTION IN WHICH AN EXCLUSIVE PROPERTY 
OR PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS s 

1 . A purified and isolated DNA segment having a sequence which 
codes for a protein associated with attaching and effacing 
activity of enterohemorrhagic E.coli . or an oligonucleotide 
fragment of the DNA segment. 

2. A purified and isolated DNA segment as claimed in claim 1 
having a sequence which codes for a protein associated with 
attaching and effacing activity of enterohemorrhagic E »coli which 
has substantial homology with the amino acid sequence as shown 
in the Sequence Listing as SEQ ID NO:l. 

3. A purified and isolated DNA segment as claimed in claim 1 
having a sequence having substantial sequence homology with the 
nucleotide sequence as shown in the Sequence Listing as SEQ ID 
NO:l. 

4* A purified and isolated DNA segment as claimed in claim 1 
having a sequence having more than ? sequence homology with the 
nucleotide sequence as shown in the Sequence Listing as SEQ ID 
NO:l. 

5. A purified and isolated single stranded nucleotide sequence 
comprising a DNA segment as claimed in claim 2, 

6. An antisense sequence of a DNA segment as claimed in claim 
1 or an oligonucleotide fragment of the antisense sequence, 

7. A recombinant molecule adapted for transformation of a host 
cell comprising a DNA segment or an oligonucleotide fragment of 
the DNA segment as claimed in claim 1 and an expression control 
sequence operatively linked to the DNA segment. 

8. A transformant host cell including a recombinant molecule 
as claimed in claim 7 . 
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A method for preparing a protein associated with attaching 
and effacing activity of enterohemorrhagic g,.coli utilizing a 
purified and isolated DNA segment as claimed in claim 2. 

10. A substantially pure protein associated with attaching and 
effacing activity of enterohemorrhagic E.coli. 

11. A substantially pure protein as claimed in claim 10 having 
an amino acid sequence which has substantial homology with the 
amino acid sequence as shown in the Sequence Listing as SEQ ID 
N0:1. 

12 . A protein encoded by the purified and isolated DNA segment 
as claimed in claim 2. 

13. A monoclonal or polyclonal antibody specific for an epitope 
of a protein as claimed in claim 10. 

14. An antibody as claimed in claim 13 which binds a distinct 
epitope in an unconserved region of the protein. 

15. A purified and isolated nucleotide sequence comprising the 
sequence shown in the Sequence Listing as SEQ ID NO: 2 or the 
sequence shown in the Sequence Listing as SEQ ID NO: 3, or a 
sequence having substantial homology thereto or a fragment of the 
nucleotide sequence. 

16. A purified and isolated nucleotide sequence comprising the 
sequence shown in the Sequence Listing as SEQ ID NO: 4 or the 
sequence shown in the Sequence Listing as SEQ ID NO: 5, or a 
sequence having substantial homology thereto or a fragment of the 
nucleotide sequence . 

17. A purified and isolated nucleotide sequence comprising the 
sequence shown in the Sequence Listing as SEQ ID NO: 6 or the 
sequence shown in the Sequence Listing as SEQ ID NO: 7, or a 
sequence having substantial homology thereto or a fragment of the 
nucleotide sequence. 
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±o. A purified and isolated nucleotide sequence comprising the 
sequence shown in the Sequence Listing as SEQ ID NO: 8 or the 
sequence shown in the Sequence Listing as SEQ ID NO: 9, or a 
sequence having substantial homology thereto or a fragment of the 
nucleotide sequence. 

19. A diagnostic kit for detecting enterohemorrhagic E.coli 0157 
in a sample comprising a monoclonal antibody as claimed in claim 
13 and directions for its use. 

20. An immunoassay for the detection of E.coli 0157 comprising 
contacting a sample suspected of containing E.coli 0157 with a 
monoclonal antibody as claimed in claim 13 to form an immune 
complex; and determining the presence of the immune complex in 
order to detect E.coli 0157 in the sample. 

21. An avirulent strain of an enterohemorrhagic E.coli 
comprising an avirulent bacterial carrier strain transformed with 
a recombinant molecule as claimed in claim 7. 

22. A vaccine composition comprising a bacterial carrier strain 
transformed with a recombinant molecule as claimed in claim 7. 

23. A method for detecting the presence of a DNA segment having 
a sequence encoding a protein related to or analogous to a 
protein associated with attaching and effacing activity of 
enterohemorrhagic E.coli or an oligonucleotide fragment thereof 
in a sample, comprising contacting the sample with a nucleotide 
probe capable of hybridizing with the DNA segment or an 
oligonucleotide fragment thereof, to form a hybridization 
product, under conditions which permit the formation of the 
hybridization product, and assaying for the hybridization 
product . 

24. A kit for detecting the presence of a DNA segment having a 
sequence encoding a protein related to or analogous to a protein 
associated with attaching and effacing activity of 
enterohemorrhagic E.coli or an oligonucleotide fragment thereof, 
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xn a sample comprisSl? a nucleotide probe capable o^%rbridizing 
with the DNA segment or an oligonucleotide fragment thereof, 
reagents required for hybridization of the nucleotide probe with 
the DNA segment or an oligonucleotide fragment thereof, and 
directions for its use. 

25- A method of determining the presence of a DNA segment having 
a sequence encoding a protein associated with attaching and 
effacing activity of enterohemorrhagic E.coli or a predetermined 
oligonucleotide fragment thereof in a sample, comprising treating 
the sample with primers which are capable of amplifying the DNA 
segment or the predetermined oligonucleotide fragment thereof in 
a polymerase chain reaction to form amplified sequences, under 
conditions which permit the formation of amplified sequences , and 
assaying for amplified sequences. 

26. A kit for determining the presence of a DNA segment having 
a sequence encoding a protein associated with attaching and 
effacing activity of enterohemorrhagic E.coli or a predetermined 
oligonucleotide fragment thereof in a sample, comprising primers 
which are capable of amplifying the DNA segment or the 
predetermined oligonucleotide fragment thereof in a polymerase 
chain reaction to form amplified sequences, means for assaying 
the amplified sequences, and directions for its use. 

27. A method for detecting verotoxin-producing E.coli of the 
serogroups 05, 026, 0103, 0111, 0118, 0145, and 0157 in a sample 
comprising contacting the sample with a Sall-StuI fragment (Cl- 
C2) comprising the sequence as shown in the Sequence Listing as 
SEQ ID NO: 2 and the sequence as shown in the Sequence Listing as 
SEQ ID NO: 3, under conditions which permit the fragment to 
hybridize with a complementary sequence in the sample to form a 
hybridization product, and assaying for the hybridization 
product . 

28. A method for detecting verotoxin-producing E.coli serogroup 
0157 in a sample comprising contacting the sample with a 0.5 kb 
fragment (A3-B2) comprising the sequence as shown in the Sequence 
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Lxsting as SEQ ID NO: 4 and the sequence as shown in the Sequence 
Listing as SEQ ID NO: 5, under conditions which permit the 
fragment to hybridize with a complementary sequence in the sample 
to form a hybridization product, and assaying for the 
hybridization product. 

29 - A method for detecting pathogenic verotoxin-producing E.coli 
strains of the serogroup 0157 in a sample comprising treating the 
sample with the nucleotide sequence P1EH shown in the Sequence 
Listing as SEQ ID NO: 4 and the nucleotide sequence P2EH5 shown 
in the Sequence Listing as SEQ ID NO: 5 , in the polymerase chain 
reaction, under conditions which permit the formation of 
amplified sequences which terminate with the nucleic acid 
sequence of one of the primers and the complementary nucleic acid 
sequence of the other of the primers; and, assaying for the 
amplified sequences. 

30. A kit for detecting verotoxin-producing E.coli of the 
serogroups 05, 026, 0103, 0111, 0118, 0145, and 0157 in a sample 
comprising a Sall-StuI fragment (C1-C2) comprising a sequence as 
shown in the Sequence Listing as SEQ ID NO: 2 and the sequence as 
shown the Sequence Listing as SEQ ID NO: 3, reagents required for 
hybridization of the fragment with a DNA segment having a 
sequence encoding the attaching and effacing protein of 
enterohemorrhagic E.coli or an oligonucleotide fragment thereof, 
and directions for its use. 

31. A kit for detecting verotoxin-producing E.coli of the 
serogroup 0157 in a sample comprising a 0.5 kb fragment (A3-B2) 
comprising a sequence as shown in the Sequence Listing as SEQ ID 
NO: 3 and the sequence as shown in the Sequence Listing as SEQ ID 
NO: 4, reagents required for hybridization of the fragment with 
a DNA segment encoding the attaching and effacing protein of 
enterohemorrhagic E.coli or an oligonucleotide fragment thereof, 
and directions for its use. 

32 . A kit for detecting pathogenic verotoxin-producing E.coli 
strains of the serogroup 0157 in a sample comprising the primers 
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t quence Listing as SEQ ID NO: Mid P2EH as 

shown in the Sequence Listing as SEQ ID NO: 6, reagents required 
for the amplification in a PCR reaction of sequences which 
terminate with the nucleic acid sequence of one of the primers 
and the complementary nucleic acid sequence of the other of the 
primers, means for assaying amplified sequences, and directions 
for its use. 
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